3574 results found.
Written
Corpus,
Language Type:
Monolingual
Languages:
English
Availability:
Not Available
License:
Size:
None Production Status:
Newly created-in progress
Use:
Dialogue
-
Paper title:Predicting Ratings of Real Dialogue Participants from Artificial Data and Ratings of Human Dialogue Observers
-
Paper track:Evaluation/oral presentation
-
Paper status:Accept Oral
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Kallirroi Georgila | ICT corpus of Internet of Things dialogues | /N |
Documentation:
None
Written
Evaluation Tool,
Language Type:
Bilingual
Languages:
English Italian
Availability:
From Owner
License:
Size:
3000 sentences Production Status:
Newly created-finished
Use:
Evaluation/Validation
-
Paper title:Multiword Expression aware Neural Machine Translation
-
Paper track:Evaluation/poster presentation
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Andrea Zaninello | test set for NMT MWE evaluation | /N |
Documentation:
None
Speech
Corpus,
Language Type:
Monolingual
Languages:
English
Availability:
Not Available
License:
Size:
4.24 GByte Production Status:
Existing-used
Use:
Speech Recognition/Understanding
-
Paper title:Evaluation of Off-the-shelf Speech Recognizers Across Diverse Dialogue Domains
-
Paper track:Evaluation/oral presentation
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Kallirroi Georgila | ICT corpus for speech recognition evaluation | /N |
Documentation:
None
Speech
Corpus,
Language Type:
Multilingual
Languages:
Basque Breton Catalan Chinese English
Availability:
Freely Available
License:
CC0
Size:
2,500 hours Production Status:
Newly created-in progress
Use:
Speech Recognition/Understanding
-
Paper title:Common Voice: A Massively-Multilingual Speech Corpus
-
Paper track:Speech/oral presentation
-
Paper status:Accept Oral
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Josh Meyer | Common Voice | /N |
Documentation:
https://voice.mozilla.org/en/datasets
Written
Corpus,
Language Type:
Multilingual
Languages:
English German Spanish italian
Availability:
Freely Available
License:
<Not Specified>
Size:
799 sentences Production Status:
Newly created-in progress
Use:
Named Entity Recognition
-
Paper title:Building Named Entity Recognition Taggers via Parallel Corpora
-
Paper track:Written
-
Paper status:Accept Oral
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Rodrigo Agerri | IXA NLP Group, University of the Basque Country (UPV/EHU) | ES |
| Author 2 | Yiling Chung | IXA NLP Group, University of the Basque Country (UPV/EHU) | ES |
| Author 3 | Itziar Aldabe | University of the Basque Country (UPV/EHU) | ES |
| Author 4 | Nora Aranberri | University of the Basque Country | ES |
| Author 5 | Gorka Labaka | University of the Basque Country (UPV/EHU) | ES |
| Author 6 | German Rigau | UPV/EHU | ES |
| Main Contact | Rodrigo Agerri | IXA NLP Group, University of the Basque Country (UPV/EHU) | None |
Documentation:
https://github.com/ixa-ehu/ner-evaluation-corpus-europarl/
Speech
Corpus,
Language Type:
Multilingual
Languages:
Arabic Czech English German french
Availability:
Freely Available
License:
<Not Specified>
Size:
<Not Specified> <Not Specified>Production Status:
Existing-used
Use:
Machine Translation, SpeechToSpeech Translation
-
Paper title:Improving Neural Machine Translation by Incorporating Hierarchical Subword Features
-
Paper track:NLP engineering experiment paper
-
Paper status:Accept Oral
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Makoto Morishita | NTT Communication Science Laboratories | JP |
| Author 2 | Jun Suzuki | NTT CS Lab. | JP |
| Author 3 | Masaaki Nagata | +81-774-93-5235 | JP |
| Main Contact | Makoto Morishita | NTT Communication Science Laboratories | None |
Documentation:
<Not Specified>Language Type:
Multilingual
Languages:
English Mandarin Chinese Russian Spanish Standard Arabic
Availability:
Freely Available
License:
<Not Specified>
Size:
600 Production Status:
Newly created-in progress
Use:
Machine Translation, SpeechToSpeech Translation
-
Paper title:The AMARA Corpus: Building Parallel Language Resources for the Educational Domain
-
Paper track:Written
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Ahmed Abdelali | Qatar Computing Research Institute | QA |
| Author 2 | Francisco Guzmán | Qatar Computing Research Institute | US |
| Author 3 | Hassan Sajjad | Qatar Computing Research Institute | QA |
| Author 4 | Stephan Vogel | Qatar Computing Research Institute | QA |
| Main Contact | Ahmed Abdelali | Qatar Computing Research Institute | None |
Documentation:
EnglishLanguage Type:
Multilingual
Languages:
English Estonian German Latvian
Availability:
Freely Available
License:
Open Source
Size:
<Not Specified> Production Status:
Newly created-in progress
Use:
Bilingual Dictionaries
-
Paper title:Bilingual dictionaries for all EU languages
-
Paper track:Multimodality
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Ahmet Aker | University of Sheffield | GB |
| Author 2 | Monica Paramita | University of Sheffield | GB |
| Author 3 | Marcis Pinnis | Tilde | LV |
| Author 4 | Robert Gaizauskas | University of Sheffield | GB |
| Main Contact | Ahmet Aker | University of Sheffield | None |
Documentation:
<Not Specified>
Written
Corpus,
Language Type:
Multilingual
Languages:
Dutch English German Spanish french
Availability:
From Owner
License:
<Not Specified>
Size:
334.4 MByte Production Status:
Existing-used
Use:
Corpus Creation/Annotation
-
Paper title:A Multilingual, Multi-style and Multi-granularity Dataset for Cross-language Textual Similarity Detection
-
Paper track:Evaluation
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Jérémy Ferrero | Université Grenoble Alpes | FR |
| Author 2 | Frédéric Agnès | Compilatio | FR |
| Author 3 | Laurent Besacier | LIG | FR |
| Author 4 | Didier Schwab | Univ. Grenoble Alpes | FR |
| Main Contact | Jérémy Ferrero | Université Grenoble Alpes | None |
Documentation:
<Not Specified>
Written
Evaluation Package,
Language Type:
Multilingual
Languages:
English Galician Portuguese Spanish
Availability:
Freely Available
License:
GPLv3
Size:
5,3 MByte Production Status:
Newly created-in progress
Use:
Named Entity Recognition
-
Paper title:Incorporating Lexico-semantic Heuristics into Coreference Resolution Sieves for Named Entity Recognition at Document-level
-
Paper track:Evaluation
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country | ||
|---|---|---|---|---|---|
| Author 1 | Marcos Garcia | University of Santiago de Compostela | ES | Universidade da Corunha | ES |
| Main Contact | Marcos Garcia | Universidade da Corunha | None |
Documentation:
README.txt




